Multiprecision division on small-word parallel processors: Expanded Version

نویسندگان

  • Eric Rice
  • Richard Hughey
چکیده

This paper presents a study of software implementation of division on small-word parallel processors containing word-by-word multipliers. It compares several algorithms for a variety of architectures. This is done by first optimizing each algorithm for the small-word environment and then comparing their performance on simple machine models. The results indicate that a modified Newton-Raphson algorithm is optimal over a wide range of conditions. The features of this algorithm include low-precision arithmetic, early introduction of the dividend, and a method by which accuracy can be inexpensively boosted if necessary by using an added term from the extended NR equation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiprecision division: Expanded Version

This paper presents a study of multiprecision division on processors containing word-by-word multipliers. It compares several algorithms by first optimizing each for the software environment, and then comparing their performances on simple machine models. While the study was originally motivated by floating-point division in the small-word environment, the results are extended to multiprecision...

متن کامل

Multiprecision Division on an 8-bit Processor

Small processors can be especially useful in massively parallel architectures. This paper considers multiprecision division algorithms on an 8-bit processor (the Kestrel processor, currently in fabrication) that includes a small amount of memory and an 8-bit multiplier. We evaluate several variations of the Newton-Raphson reciprocal approximation methods for use with division. Our final singlep...

متن کامل

Speeding up the Stress Analysis of Hollow Circular FGM Cylinders by Parallel Finite Element Method

In this article, a parallel computer program is implemented, based on Finite Element Method, to speed up the analysis of hollow circular cylinders, made from Functionally Graded Materials (FGMs). FGMs are inhomogeneous materials, which their composition gradually varies over volume. In parallel processing, an algorithm is first divided to independent tasks, which may use individual or shared da...

متن کامل

Mixed Large-Eddy Simulation Model for Turbulent Flows across Tube Bundles Using Parallel Coupled Multiblock NS Solver

In this study, turbulent flow around a tube bundle in non-orthogonal grid is simulated using the Large Eddy Simulation (LES) technique and parallelization of fully coupled Navier – Stokes (NS) equations. To model the small eddies, the Smagorinsky and a mixed model was used. This model represents the effect of dissipation and the grid-scale and subgrid-scale interactions. The fully coupled NS eq...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998